LinkScope: Interactive Graph Analysis of Unstructured Text
نویسندگان
چکیده
This paper presents LinkScope, a toolkit for interactive analysis of text using node link graphs, with support for dynamic addition of attributes from tabular data. The interaction technique draws on ideas from 3D modeling, mesh deformation, and static graph drawing to promote discovery of hidden information across a wide variety of graph types and analysis tasks. The key innovation of this work is the application of methods traditionally reserved for automated graph layout and clustering, to produce useful task-specific layout through dynamic interactions. Graph nodes are dynamically repositioned using an interpolated decay function over a single node movement provided by a user. We describe several variants of the interpolation method, including coupling it with a fast local-cut algorithm for cluster selection. Compared to traditional layout mechanisms the technique is particularly useful when meta-data nodes are added to a graph, increasing its connectivity. We show how the techniques can be used interactively to solve text analysis tasks including a case study on a collection of 16K awarded NSF grant proposals with metadata and a corpus of New York Times news articles.
منابع مشابه
TextTile: An Interactive Visualization Tool for Seamless Exploratory Analysis of Structured Data and Unstructured Text
We describe TextTile, a data visualization tool for investigation of datasets and questions that require seamless and flexible analysis of structured data and unstructured text. TextTile is based on real-world data analysis problems gathered through our interaction with a number of domain experts and provides a general purpose solution to such problems. The system integrates a set of operations...
متن کاملAkshaya: A Framework for Mining General Knowledge Semantics From Unstructured Text
We report a tool called Akshaya, which implements a framework to mine four types of “general knowledge semantics” (analytical semantics) from unstructured text. The semantics being mined are semantic siblings, topical anchors, topic expansion and topical markers. The framework provides options to embed more such general knowledge semantic mining algorithms into it. We use a term co-occurrence g...
متن کاملInteractive Information Extraction and Navigation to Enable Effective Link Analysis and Visualization of Unstructured Text
This paper describes the Advanced Text Exploitation Assistant (ATEA), a system developed to enable intelligence analysts to perform link analysis and visualization (A&V) from information in large volumes of unstructured text. One of the key design challenges that had to be addressed was that of imperfect Information Extraction (IE) technology. While IE seems like a promising candidate for explo...
متن کاملخوشهبندی اسناد مبتنی بر آنتولوژی و رویکرد فازی
Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...
متن کاملIn-depth Interactive Visual Exploration for Bridging Unstructured and Structured Document Content
Semi-structured data refers to the combination of unstructured and structured data. Unstructured data is free text in natural language, while structured data is typically stored in tables and following a data schema. Recent statistics shows that 80% of the data generated in the last two years is unstructured. However, one interesting observation is that free text usually comes along with some s...
متن کامل